Search CORE

26 research outputs found

BayOTIDE: Bayesian Online Multivariate Time series Imputation with functional decomposition

Author: Fang Shikai
Sun Liang
Wen Qingsong
Zhe Shandian
Publication venue
Publication date: 28/08/2023
Field of study

In real-world scenarios like traffic and energy, massive time-series data with missing values and noises are widely observed, even sampled irregularly. While many imputation methods have been proposed, most of them work with a local horizon, which means models are trained by splitting the long sequence into batches of fit-sized patches. This local horizon can make models ignore global trends or periodic patterns. More importantly, almost all methods assume the observations are sampled at regular time stamps, and fail to handle complex irregular sampled time series arising from different applications. Thirdly, most existing methods are learned in an offline manner. Thus, it is not suitable for many applications with fast-arriving streaming data. To overcome these limitations, we propose \ours: Bayesian Online Multivariate Time series Imputation with functional decomposition. We treat the multivariate time series as the weighted combination of groups of low-rank temporal factors with different patterns. We apply a group of Gaussian Processes (GPs) with different kernels as functional priors to fit the factors. For computational efficiency, we further convert the GPs into a state-space prior by constructing an equivalent stochastic differential equation (SDE), and developing a scalable algorithm for online inference. The proposed method can not only handle imputation over arbitrary time stamps, but also offer uncertainty quantification and interpretability for the downstream application. We evaluate our method on both synthetic and real-world datasets

arXiv.org e-Print Archive

Dynamic Tensor Decomposition via Neural Diffusion-Reaction Processes

Author: Fang Shikai
Li Shibo
Wang Zheng
Zhe Shandian
Publication venue
Publication date: 30/10/2023
Field of study

Tensor decomposition is an important tool for multiway data analysis. In practice, the data is often sparse yet associated with rich temporal information. Existing methods, however, often under-use the time information and ignore the structural knowledge within the sparsely observed tensor entries. To overcome these limitations and to better capture the underlying temporal structure, we propose Dynamic EMbedIngs fOr dynamic Tensor dEcomposition (DEMOTE). We develop a neural diffusion-reaction process to estimate dynamic embeddings for the entities in each tensor mode. Specifically, based on the observed tensor entries, we build a multi-partite graph to encode the correlation between the entities. We construct a graph diffusion process to co-evolve the embedding trajectories of the correlated entities and use a neural network to construct a reaction process for each individual entity. In this way, our model can capture both the commonalities and personalities during the evolution of the embeddings for different entities. We then use a neural network to model the entry value as a nonlinear function of the embedding trajectories. For model estimation, we combine ODE solvers to develop a stochastic mini-batch learning algorithm. We propose a stratified sampling method to balance the cost of processing each mini-batch so as to improve the overall efficiency. We show the advantage of our approach in both simulation study and real-world applications. The code is available at https://github.com/wzhut/Dynamic-Tensor-Decomposition-via-Neural-Diffusion-Reaction-Processes

arXiv.org e-Print Archive

Analysis of Multivariate Scoring Functions for Automatic Unbiased Learning to Rank

Author: Ai Qingyao
Fang Shikai
Li Shibo
Wang Yulan
Yang Tao
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 20/08/2020
Field of study

Leveraging biased click data for optimizing learning to rank systems has been a popular approach in information retrieval. Because click data is often noisy and biased, a variety of methods have been proposed to construct unbiased learning to rank (ULTR) algorithms for the learning of unbiased ranking models. Among them, automatic unbiased learning to rank (AutoULTR) algorithms that jointly learn user bias models (i.e., propensity models) with unbiased rankers have received a lot of attention due to their superior performance and low deployment cost in practice. Despite their differences in theories and algorithm design, existing studies on ULTR usually use uni-variate ranking functions to score each document or result independently. On the other hand, recent advances in context-aware learning-to-rank models have shown that multivariate scoring functions, which read multiple documents together and predict their ranking scores jointly, are more powerful than uni-variate ranking functions in ranking tasks with human-annotated relevance labels. Whether such superior performance would hold in ULTR with noisy data, however, is mostly unknown. In this paper, we investigate existing multivariate scoring functions and AutoULTR algorithms in theory and prove that permutation invariance is a crucial factor that determines whether a context-aware learning-to-rank model could be applied to existing AutoULTR framework. Our experiments with synthetic clicks on two large-scale benchmark datasets show that AutoULTR models with permutation-invariant multivariate scoring functions significantly outperform those with uni-variate scoring functions and permutation-variant multivariate scoring functions.Comment: 4 pages, 2 figures. It has already been accepted and will show in Proceedings of the 29th ACM International Conference on Information and Knowledge Management (CIKM '20), October 19--23, 202

arXiv.org e-Print Archive

Crossref

The adaptation of Arctic phytoplankton to low light and salinity in Kongsfjorden (Spitsbergen)

Author: Fang Zhang
Jianfeng He
Ling Lin
Peimin He
Shikai Cui
Yuxin Ma
Publication venue: Polar Research Institute of China - PRIC
Publication date: 01/03/2012
Field of study

The basic environmental variables and adaptability of phytoplankton communities to low light and salinity were studied using incubation experiments in Kongsfjorden, a high Arctic fjord of Spitsbergen, in late summer 2006. Chlorophyll a concentrations were steady or decreased slightly in darkness after one day or one week incubation. Chlorophyll a concentrations showed an initial decline when exposed to natural light after one week incubation in darkness, and then increased significantly. In a salinity experiment, the maximal growth rate was observed at a dilution ratio of 10%, however, higher dilution ratios (≥40%) had an obvious negative effect on phytoplankton growth. We suggest that the phytoplankton communities in fjords in late summer are darkness adapted, and the inflow of glacial melt water is favorable for phytoplankton growth in the outer fjords where the influence of freshwater is limited

Arctic Portal Library

Provably Convergent Schr\"odinger Bridge with Applications to Probabilistic Time Series Imputation

Author: Chen Yu
Deng Wei
Fang Shikai
Li Fengpei
Nevmyvaka Yuriy
Rasul Kashif
Schneider Anderson
Yang Nicole Tianjiao
Zhang Yikai
Zhe Shandian
Publication venue
Publication date: 03/06/2023
Field of study

The Schr\"odinger bridge problem (SBP) is gaining increasing attention in generative modeling and showing promising potential even in comparison with the score-based generative models (SGMs). SBP can be interpreted as an entropy-regularized optimal transport problem, which conducts projections onto every other marginal alternatingly. However, in practice, only approximated projections are accessible and their convergence is not well understood. To fill this gap, we present a first convergence analysis of the Schr\"odinger bridge algorithm based on approximated projections. As for its practical applications, we apply SBP to probabilistic time series imputation by generating missing values conditioned on observed data. We show that optimizing the transport cost improves the performance and the proposed algorithm achieves the state-of-the-art result in healthcare and environmental data while exhibiting the advantage of exploring both temporal and feature patterns in probabilistic time series imputation.Comment: Accepted by ICML 202

arXiv.org e-Print Archive

The adaptability of three Arctic microalgae to different low temperatures

Author: Fang Zhang
Jianfeng He
Lihua Xia
Rumin Zhang
Shikai Cui
Yan Gao
Publication venue: Polar Research Institute of China - PRIC
Publication date: 01/12/2010
Field of study

In order to study the adaptability of Arctic microalgae to different environmental temperatures, the growth curves and antioxidase system of three microalgae (Skeletonema marinoi, Chlorella sp. and Chlamydomonas sp.) that were separated from the Ny-Ålesund, the high Arctic, at different low temperatures (0°C, 4°C and 8°C) were determined. The result showed that the adaptability of the microalgae to temperatures depended on the species. The growth rate, SOD and CAT activities of Skeletonema marinoi were the highest at 4°C, but MDA content was the lowest. The growth rate and enzyme activity of Chlorella sp. were the highest at 8°C, while the lowest MDA content presented at 0°C. The growth of Chlamydomonas sp. at the different temperatures was not so significant, the lowest MDA content presented at 8°C. The change of antioxidase system also depended on species and temperatures. Three indexes of antioxidase system of Skeletone mamarinoi between 0°C and 4°C showed extremely significant difference (p <0.01).SOD activity of Skeletonema marinoi and Chlorella sp. between 0°C and 8°C showed significant difference (p<0.05), and the other two indexes of them differed insignificantly. Antioxidase systems of Chlamydomonas sp. at the three temperatures differed insignificantly. In conclusion, the three microalgae had good adaptability to the three temperatures; their MDA content presented a low level, and had unique physiological mechanism to adapt to the environment with different low temperatures

Arctic Portal Library

Genome-wide identification and expression analysis of 3-ketoacyl-CoA synthase gene family in rice (Oryza sativa L.) under cadmium stress

Author: Bo Xu
Changkai Lai
Fengli Zhao
Gaoneng Shao
Guiai Jiao
Jingxin Wang
Junchao Fang
Liang Zhou
Lihong Xie
Ling Wang
Lingwei Yang
Peisong Hu
Shaoqing Tang
Shikai Hu
Suozhen Hui
Xiangjin Wei
Ying Chen
Yuanyuan Zhang
Yujuan Chen
Zhonghua Sheng
Publication venue: Frontiers Media S.A.
Publication date: 01/07/2023
Field of study

3-Ketoacyl-CoA synthase (KCS) is the key rate-limiting enzyme for the synthesis of very long-chain fatty acids (VLCFAs) in plants, which determines the carbon chain length of VLCFAs. However, a comprehensive study of KCSs in Oryza sativa has not been reported yet. In this study, we identified 22 OsKCS genes in rice, which are unevenly distributed on nine chromosomes. The OsKCS gene family is divided into six subclasses. Many cis-acting elements related to plant growth, light, hormone, and stress response were enriched in the promoters of OsKCS genes. Gene duplication played a crucial role in the expansion of the OsKCS gene family and underwent a strong purifying selection. Quantitative Real-time polymerase chain reaction (qRT-PCR) results revealed that most KCS genes are constitutively expressed. We also revealed that KCS genes responded differently to exogenous cadmium stress in japonica and indica background, and the KCS genes with higher expression in leaves and seeds may have functions under cadmium stress. This study provides a basis for further understanding the functions of KCS genes and the biosynthesis of VLCFA in rice

Directory of Open Access Journals

Investigation and Analysis of Guangzhou Nansha Coast Park Point Source Pollution and Non-point Source Pollution

Author: Chen Budan
Chen Fang
Wu Shikai
Yang Ruijie
Zeng Huanghuang
Publication venue
Publication date: 01/04/2017
Field of study

[Objective]To find out the situation of Nansha Coast Park point and non-point source pollution. [Method] By investigating the park water environment analysis of point and non-point source pollutants contribution rate, setting up water quality monitoring sites for basic data related indicators and then using national water quality standards to evaluate water quality.[Result]The Coast Park point source pollution mainly comes from the campus greeting fertilizer spraying. The COD of lakes and river outside the park and ammonia mean concentration belong to grade III. The total nitrogen of lake belongs to grade III. The total phosphorus belongs to grade IV. The total nitrogen of river is the worst. The total phosphorus is grade V. [Conclusion] The lake water quality is highly affected by the point and non-point source pollution, the quality of the river is worse than that of the lake in the park, and it needs powerful governance

AgEcon Search - Research in Agricultural & Applied Economics

AgEcon Search: Research in Agricultural and Applied Economics

In vitro remineralization of enamel white spot lesions with a carrier-based amorphous calcium phosphate delivery system

Author: He Hong
Hua Fang
Yan Jiarong
Yang Hongye
Zhao Shikai
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

The University of Manchester - Institutional Repository